CrowdFormer: Weakly-supervised crowd counting with improved generalizability
نویسندگان
چکیده
Convolutional neural networks (CNNs) have dominated the field of computer vision for nearly a decade. However, due to their limited receptive field, CNNs fail model global context. On other hand, transformers, an attention-based architecture, can context easily. Despite this, there are studies that investigate effectiveness transformers in crowd counting. In addition, majority existing crowd-counting methods based on regression density maps which requires point-level annotation each person present scene. This task is laborious and also error-prone. has led increased focus weakly-supervised methods, require only count-level annotations. this paper, we propose method counting using pyramid transformer. We conducted extensive evaluations validate proposed method. Our achieves state-of-the-art performance. More importantly, it shows remarkable generalizability.
منابع مشابه
Improved Techniques For Weakly-Supervised Object Localization
We propose an improved technique for weakly-supervised object localization. Conventional methods have a limitation that they focus only on most discriminative parts of the target objects. The recent study addressed this issue and resolved this limitation by augmenting the training data for less discriminative parts. To this end, we employ an effective data augmentation for improving the accurac...
متن کاملCounting with the Crowd
In this paper, we address the problem of selectivity estimation in a crowdsourced database. Specifically, we develop several techniques for using workers on a crowdsourcing platform like Amazon’s Mechanical Turk to estimate the fraction of items in a dataset (e.g., a collection of photos) that satisfy some property or predicate (e.g., photos of trees). We do this without explicitly iterating th...
متن کاملWeakly supervised parsing with rules
This work proposes a new research direction to address the lack of structures in traditional n-gram models. It is based on a weakly supervised dependency parser that can model speech syntax without relying on any annotated training corpus. Labeled data is replaced by a few hand-crafted rules that encode basic syntactic knowledge. Bayesian inference then samples the rules, disambiguating and com...
متن کاملUnsupervised Crowd Counting
Most crowd counting methods rely on training with labeled data to learn a mapping between image features and the number of people in the scene. However, the nature of this mapping may change as a function of the scene, camera parameters, illumination etc., limiting the ability of such supervised systems to generalize to novel conditions. Here we propose an alternative, unsupervised strategy. Th...
متن کاملWeakly-supervised appraisal analysis
This article is concerned with the computational treatment of Appraisal, a Systemic Functional Linguistic theory of the types of language employed to communicate opinion in English. The theory considers aspects such as Attitude (how writers communicate their point of view), Engagement (how writers align themselves with respect to the opinions of others) and Graduation (how writers amplify or di...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Visual Communication and Image Representation
سال: 2023
ISSN: ['1095-9076', '1047-3203']
DOI: https://doi.org/10.1016/j.jvcir.2023.103853